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ABSTRACT 


Based  on  right-censored  data  from  a  lifetime  distribution  Fq,  a  kernel 
type  estimator  of  the  quantile  function  Q°(p)  *  inf{t:  Fq(0  £  p},  0  S  p  &  1, 
is  proposed.  The  estimator  is  defined  by  Qn(p)  “  hfl  /q  Qn(t)K((t-p)/hn>dt , 
which  is  smoother  than  the  usual  product-limit  quantile  function 

A  A  A. 


Q^(p)  ■  inf{t:  Fn(t)  2  p},  where  Fn  denotes  the  product-limit  estimator  of 
Fq  from  the  censored  sample.  Under  the  random  censorship  model  and  general 
conditions  on  h^.K,  and  Fq,  it  is  shown  that  (^(p)  is  strongly  consistent. 
In  addition,  an  approximation  to  is  shown  to  be  asymptotically  equiv¬ 
alent  to  with  probability  one.  A  small  Monte. Carlo  simulation 

study  shows  that  for  several  values  of  the  bandwidth  h^,  performs  better 

A 

than  in  the  sense  of  estimated  mean  squared  errors.  The  estimator  is 

illustrated  by  an  application  to  data  from  a  mechanical- switch  life  test. 


Key  Words:  Random  censorship;  Product-limit  quantile  function;  Kernel 
estimation;  Median  survival  time  estimation;  Nonparametric  quantile  estimation. 
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1.  INTRODUCTION 


J  Arbitrarily  right-censored  data  arise  naturally  in  industrial  life 
testing  and  medical  follow-up  studies.  In  these  situations  it  is  important 
to  be  able  to  obtain  nonparametric  estimates  of  various  characteristics  of 
the  survival  function  S.  Based  on  such  right-censored  data,  Kaplan  and  Meier 
^^7— gave  the  nonparametric  maximum  likelihood  estimator  of  S,  called  the 
product-limit  estimator,  and,  among  others,  Reid  (1981)  has  proposed  methods 
of  estimating  the  median  survival  time  from  the  product-limit  estimator. 
Recently,  Nair  (1984)^ studied  the  problem  of  confidence  bands  for  the  survival 
function  obtained  from  the  product-limit  estimator.  Also,  Padgett  and  McNichols 
C19M}"and  McNichols  and  Padgett  (1984)  have  discussed  estimation  of  a  density 
for  the  survival  distribution  based  on  right-censored  data. 

One  characteristic  of  the  survival  distribution  that  is  of  interest  is  the 
quantile  function,  which  is  useful  in  reliability  and  medical  studies.  For  any 


probability  distribution  function  G,  the  quantile  function  is  defined  by  ^ 

Q(p)  =  G-1(p)  =  Cp  *  G(x)  2  p} ,  0  S  p  i  1.  In  particular,  ^  is  a 

median  of  G.  For  a  random  (uncensored)  sample  Y^»**’»Yn  ^Tom  G,  the  sample 

quantile  function  G  ^(p)  ■  inf{x:  G  (x)  2  p),  0  s  p  s  1,  has  been  used  to 

n  n 

estimate  £  ,  where  G  (x)  denotes  the  sample  distribution  function.  Note  that 
P  n 

Gn1(p)  ■  ^([np])*  t*ie  order  statistic  among  Y^,...,Yn,  where  |«) 

denotes  the  greatest  integer  function.  Csorgo"  (1983)  gives  many  of  the  known 


results  concerning  Gr  (p) .  Also,  Falk  (1984)  has  recently  studied  the  relative 
deficiency  of  the  sample  quantile  with  respect  to  kernel  type  estimators. 


Gther  nonparametric  estimators  of  the  quantile  function  from  uncensored 
data  have  been  proposed  which  are  smoother  than  the  sample  quantile  function. 


For  example,  Kaigh  and  Lachenbruch  (1962)  considered  a  "generalized  sample 
quantile"  obtained  by  averaging  an  appropriate  subsample  quantile  over  all 
aubsaaples  of  a  given  size.  Also  recently,  Yang  (1984)  has  studied  the 
properties  of  kernel-type  estimators  of  which  smooth  the  sample  quantile 

function.  Parzen  (1979)  had  mentioned  kernel  estimators  as  a  possible  class 
of  quantile  estimators,  but  did  not  investigate  their  properties. 

For  arbitrarily  right-censored  data,  Sander  (1975)  proposed  estimation  of 

£  by  the  quantile  function  of  the  product-limit  estimator  of  1-S,  and  she 

P 

and  Cheng  (1981)  obtained  some  asymptotic  properties  of  that  estimator.  Csorgo' 
(1983)  presented  strong  approximation  results  for  this  estimator. 

— The  quantile  function  of  the  product-limit  estimator  is  a  step  function 
with  jumps  corresponding  to  the  uncensored  observations.  The  purpose  of  this 
paper  is  to  present  a  smoothed  nonparametric  estimator  of  the  quantile  function 
from  arbitrarily  right-censored  data  based  on  the  kernel  method.  It  will  be 
shown  that  under  general  conditions  this  estimator,  mentioned  briefly  by  l’arzen 
(1979,  p.  119),'  is  strongly  consistent,  and  based  on  the  results  of  a  small 
Monte  Carlo  simulation  study,  performs  better  than  the  quantile 
function  of  the  product-limit  estimator  in  the  sense  of  smaller  mean  squared 
error.  In  particular,  better  estimates  of  the  median  survival  time  are  obtainable 
In  addition,  an  approximation  to  the  kernel  estimator  will  be  shown  to  be  almost 
surely  asymptotically  equivalent  to  it  under  certain  conditions.  Finally, 
estimates  of  the  quantile  function  from  the  randomly  right-censored  data  given 
by  Nalr  (1984)  are  presented  as  an  illustration. 


2.  ARBITRARILY  RIGHT-CENSORED  DATA 


Let  X°,...,X°  denote  the  true  survival  times  of  n  items  or  individuals 
1  n 

which  are  censored  on  the  right  by  a  sequence  . Uq,  which  in  general 

may  be  either  constants  or  random  variables.  It  is  assumed  that  the  X°'s 

are  nonnegative  independent  identically  distributed  random  variables  with  common 

unknown  distribution  function  Fq  and  unknown  quantile  function 

Q°(p)  =  =  inf{t:  FQ(t)  ip},0SpSl,  Also,  Q°(p)  is  sometimes  denoted 

by  f"1^). 

The  observed  right-censored  data  are  denoted  by  the  pairs  (X.,A.), 

1*1,..., n,  where 

II  if  X°  <  Uj 
0  if  X°  >  U±  . 


Thus,  It  is  known  which  observations  are  times  of  failure  or  death  and  which 

ones  are  censored  or  loss  times.  The  nature  of  the  censoring  depends  on  the 

U^'s.  (i)  If  U^,..,,Un  are  fixed  constants,  the  observations  are  time- 

truncated.  If  all  U^'s  are  equal  to  the  same  constant,  then  the  case  of 

Type  I  censoring  results,  (ii)  If  all  *  ^(r)  ’  t*ie  r—  or<*er  statistic 

of  X°,...,X°,  then  the  situation  is  that  of  Type  II  censoring,  (iii)  If 
1  n 

U. ,...,U  constitute  a  random  sample  from  a  distribution  H  (usually  unknown) 
I  n 

and  are  independent  of  X°,...,X°,  then  (X^,A^),  i*l,2,..,,n,  is  called  a 
randomly  right-censored  sample. 

The  random  censorship  model  (iii)  is  assumed  for  the  results  presented 
here.  For  this  model,  A^,...,An  are  independent  Bernoulli  random  variables, 
and  the  distribution  function  F  of  each  X^,  i*l,...,n,  is  given  by 


Based  on  the  censored  sample  (X^,Ap,  i=l,2,...,n,  a  popular  estimator 
of  the  survival  function  l-F^t)  at  t  £  0  is  the  product-limit  estimator, 
proposed  by  Kaplan  and  Meier  (1958)  as  the  "nonpararaetric  maximum  likelihood 
estimator".  Efron  (1967)  showed  that  this  estimator,  defined  next,  is  "self- 
consistent".  Let  (Z^Ap,  i*l,...,n,  denote  the  ordered  X/s  along  with 
their  corresponding  A^'s.  A  value  of  the  censored  sample  will  be  denoted  by 
the  corresponding  lower  case  letters  (x^.dp  and  for  unordered 

and  ordered  sample,  respectively.  Then  the  product-limit  estimator  of 
l-Fg(t)  is  defined  by 


Pn(t)  = 


1,  0  s  t  <  zlt 

(^S?r)6i»  Zk-X " c '  v  k=2 . n 


z  <  t. 

n 


Denote  the  product-limit  estimator  of  F_(t)  by  F  (t)  *  1  -  P  (t),  and  let 

u  n  n 

A  A 

denote  the  jump  of  P  (or  F  )  at  Z.,  that  is 
j  n  n  j 


1  -  P(Z_), 
n  L 


Pn(zp  -  Pn(Zj+1),  j-2 . n-1 


p  ( Z  )  , 
n  n 


Note  that  s  *  0  if  and  only  if  6'  *  0,  j  <  n,  that  is,  if  Z.  is  a  censored 
3  „  J  i  3 

observation.  Also,  denote  =  Fn(zp  ■  £  s j »  1=1*2, .. .,n. 

The  product-limit  estimator  has  played  a  central  role  in  the  analysis  of 
censored  survival  data  (Miller,  1981).  Its  properties  have  been  studied  by 
many  authors,  for  example,  Breslow  and  Crowley  (1974),  Foldes  and  RejtK  (1981), 


Foldes,  Rejto  and  Winter  (1980),  and  Gill  (1983). 


Based  on  randomly  right-censored  data,  it  is  natural  to  estimate  the 
quantile  function  Q°(p)  by  the  product-limit  (PL)  quantile  function 

*  *  A 

Q^(p)  =  £  =  inf{t:  ®n(t)  ^  p}»  Cheng  (1981)  obtained  asymptotic  normality 

*  o 

results  for  and  gave  an  asymptotic  expression  for  in  terms  of 


F  ,  and  f_,  a  density  function  of  F-.  Csorgb"*  (1983)  presented  strong  approxi 
n  0  U 

A 

mat ion  theorems  for  the  PL  quantile  process  Qn> 


3.  THE  QUANTILE  ESTIMATOR 


In  this  section  the  kernel  estimator  of  Q°(p),  0  S  p  i  1,  from  the 
randomly  right-censored  observations  (X^,A^),  i=l,...,n,  will  be  defined. 
Similar  to  Yang's  (1984)  estimators  for  the  uncensored  case,  an  approximation 
which  is  often  easier  to  compute  will  be  given.  First,  some  assumptions  and 
notation  concerning  the  kernel,  the  bandwidth  sequence,  and  the  lifetime  and 
censoring  distributions  will  be  listed. 

Let  (h  }  be  a  "bandwidth"  sequence  of  positive  numbers  such  that 


(h.l) 

h  0  as  n  -► 

n 

K  be  a 

real-valued  function  defined  on 

(-“>,«)  such  that 

(K.l) 

K(x)  £  0,  all  real  numbers 

x. 

(K.2) 

Co  *<*>**  -  1. 

(K.3) 

K  has  finite  support,  that 

is. 

K(x)  *  0  for  |x|  >  c 

for  some  c  >  0, 

(K.4) 

sup  j K(x) J  <  ®,  that  is,  K 

is 

bounded , 

7 


(K.5)  K  is  symmetric  about  zero,  and 

(K.6)  K  satisfies  a  Lipschitz  condition,  that  is,  there  exists 
a  constant  T  such  that  for  all  x,y, 
jK(x)  -  K(y)  |  sT]x  -  y| . 

Notice  that  conditions  (K.l)  -  (K.2)  simply  say  that  K  must  be  a  profcib.  ii 
■1 insity  function.  Also,  assume  that  the  lifetime  distribution  Fq  ar.J  the 
censoring  distribution  H  are  such  that 

(F.l)  Fq  is  continuous  with  density  function  f^, 

(F.2)  fn  is  continuous  at  £°  =  Q°(p)  and  f-(£°)  >  0, 

u  p  Up 

(F.3)  Fq  has  a  finite  mean,  and 

(F.A)  H(T  )  <  1,  where  T„  *  sup{t:  F-(t)  <  l} . 

F0  r0  0 

It  should  be  noted  that  the  conditions  (F.l)  -  (F.A)  are  not  prohi'nx  ivc. 
and  are  similar  to  those  assumed  by  Cheng  (1981).  Condition  (F.A)  is  usually 
required  for  asymptotic  results  with  random  right-censorship  and  guarantees 
that  observations  can  be  obtained  from  the  entire  support  of  the  distribution 
Fq.  The  conditions  (h.l),  (K.3  -  K.6),  and  (F.2)  -  (F.A)  are  requi^t 
the  asymptotic  results  of  Section  A. 

Now,  for  0  £  p  £  1,  define  the  kernel  type  quantile  function  est  it- ■ 
Vp)  '  hn1  ij  V'>K«t-p)/hn)dt 

’  "n1  l  h  V  t«t-P>/Vdt- 

i-1  i-1 


(3.1) 


jncsn- 


where  S.  *  F  (Z.).  It  should  be  noted  that  only  those  Z,  which  are 
ini  i 

sored  appear  in  the  sun  (3.1)  since 


S. 

/-1  K((t-p)/h  )dt  = 

Vl  n 


0,  if  Z^  is  censored 

h  [K*((S  -p)/h  )  -  K*((S.  -p)/h  ], 

n  i  n  l—i  n 


if 


is  uncensored. 


where  K  denotes  the  cumulative  distribution  function  of  K. 

An  approximation  to  the  estimator  (3.1)  can  be  obtained  by  noticing  that 

* 

the  derivative  of  K  at  (S^-p)/hn  is  approximated  by 

(hn/s1)  [K*((S±  -  P)/hn)  -  K*((Si_1 -p)/hn)  «  K((Si-p)/hn). 


Hence,  when  is  small,  (3.1)  is  approximately  equal  to 


Again,  since  *  0  when  6^  =  0,  i  <  n,  only  the  uncensored  observations  explicitly 
appear  in  the  sum  (3.2). 

In  the  case  of  no  censoring,  (3.1)  and  (3.2)  reduce  to  the  kernel  estimators 
of  Yang  (1984).  He  has  shown  that  his  estimators  are  asymptotically  equivalent 
in  mean  square  and  obtained  rates  of  convergence  for  the  variance  and  hi--; 

Due  to  the  censoring,  similar  results  for  the  variance,  bias,  and  mem:  :  . 

consistency  of  (3.1)  and  (3.2)  seem  to  be  difficult,  if  not  impossible' > to  cui  m 
under  general  conditions  on  and  H.  Some  asymptotic  results,  however,  hav- 

been  obtained  under  reasonable  conditions  and  are  presented  in  the  next  section. 


9 


4.  ASYMPTOTIC  RESULTS 


Here  two  asymptotic  results  for  and  will  be  obtained. 
First,  the  almost  sure  consistency  of  the  kernel  estimator  O^/p)  is 
stated.  The  proofs  of  Theorems  1  and  2  are  given  in  the  Appendix. 


Theorem  1.  Suppose  the  conditions  (h.l),  (K.l)  -  (K.6),  and  (F.l; 

3/4  -1 

(F.4)  hold.  If  (log  log  n/n)  h^  -+  0  as  n  +  %  then  for  each 
0  <  p  <  1,  Q^(p)  "*■  Q°(p)  as  n  ■+•  00  with  probability  one. 

The  two  estimators  and  can  be  shown  to  be  asymptotically 

almost  surely  (uniformly  in  p)  equivalent  under  general  conditions. 


First,  define  u(t)  £  P  (x)dx,  t  >  0. 

u  n  — 


Theorem  2.  Suppose  and  H  are  continuous  and  that  (h.l),  (KJ), 

(K.2) ,  (K.6),  (F.3),  and  (F.4)  hold.  If  lim  sup  y(Z  )  <  °°  with 

n  n 

probability  one,  then 

p:  !Q^(p)  -  Q^p)  |  ”  0(h~2  (log  log  n/n)*5)]  =  1. 

-2  h 

Thus,  if  hR  (log  log  n/n)  -*■  0  as  n  then  under  the  above 

conditions  and  are  (uniformly  in  p)  asymptotically  equivalent 

with  probability  one. 


It  should  be  remarked  that  lim  sup  y(Z  )  <  00  almost  surely 

n  n 

under  the  conditions  given  by  Susarla  and  Van  Ryzin  (1980),  for  example 


Also,  if  -  D  n  ^  for  0  <  b  <  h  and  some  positive  constant 

-2  % 

D,  the  condition  that  hR  (log  log  n/n)  -*■  0  holds.  It  seems  to 
be  quite  difficult  to  obtain  the  exact  mean  squared  error  of 
or  and  to  be  able  to  choose  {tO  to  minimize  this  mean  squared 

error  or  to  choose  an  optimal  (lO  in  some  other  sense.  Some 
simulation  results  presented  in  the  next  section  indicate  a  range 
of  possible  h^  values  for  which  the  mean  squared  errors  of 
(and  Q*)  are  less  than  those  of  the  PL  quantile  estimator. 


5.  SOME  SIMULATION  RESULTS  AND  AN  EXAMPLE 

A  small  Monte  Carlo  study  was  performed  in  order  to  provide  some 

small-sample  comparisons  of  and  with  the  PL  quantile  estimator, 

and  with  each  other,  in  the  sense  of  mean  squared  errors.  The  study 

also  provides  some  insight  into  the  choice  of  reasonable  values  for 

h  which  might  be  used  in  practice.  The  random  censorship  model 
n 

with  Fq(0  *  1  -  exp(-t)  and  H(t)  =  1  -  exp(-Xt)  was  used  with  X 
chosen  to  give  50%  censoring  or  30%  censoring  as  in  Reid  (1981).  The 

A 

ratios  of  the  mean  squared  error  of  Q„(p)  to  the  mean  squared  errors 

of  the  smoothed  estimators  (^(p)  and 

0  <  p  <  1  and  sample  sizes  n  *  50  and  100.  For  each  case,  1000  censored 

samples  were  generated  using  the  uniform  random  number  generator  GGUBS 

in  the  International  Mathematical  and  Statistical  Libraries  (1982)  on 

a  DEC  VAX  11-750  computer.  The  standard  errors  of  the  simulated  mean 

-1  -4 

squared  errors  ranged  from  10  to  10 


Q^(p)  were  computed  for  various; 


Table  1  shows  some  of  the  results  for  the  triangular  kernel 

K(x)  ■  1  -  |xj,  |x|  <  1,  which  satisfies  the  conditions  (K.l)  -  (K.6) 

of  Section  3.  The  simulations  were  run  for  values  of  h  ■  0.01  (0.02) 

n 

0.61.  For  the  estimator  (^(p),  for  each  value  of  p  listed  there  is 


an  h  which  gives  smaller  estimated  mean  squared  error  than  the  nj 
n 


quantile  estimator.  In  particular,  this  is  true  for  several  h 


values  for  the  median  estimators  0^(0. 5)  and  Qn(0.5).  The  appiox: 
matlon  Q^(p)  performs  well  for  several  h^  values  when  p  £  0.5, 
but  not  so  for  larger  p.  As  would  be  expected  for  more  severe 
censoring,  the  performance  of  either  estimator  at  large  values  of  p 


is  not  as  good  as  for  values  near  0.5.  Notice  that  h  values  of 

n 


0.09  to  0.13  appear  to  be  best  for  (^(p)  over  all  p  in  Table  ).(«) 
with  n  *  100,  whereas  for  Q^(p)  the  hn  should  be  somewhat  larger 
(0.15  to  0.21)  for  a  good  estimator  over  all  p.  Generally,  the  best 
hR  for  Q^(p)  is  larger  than  that  for  Qn(p),  indicating  that  Q^* 
requires  slightly  more  smoothing  than  Q^. 

The  results  of  Table  2  are  for  the  uniform  kernel  K(x)  *  1, 

|x|  *s.  This  kernel  does  not  satisfy  condition  (K.6),  but  the  sim¬ 

ulation  results  are  quite  similar  except  perhaps  for  the  best  choicer 


of  values  of  h  . 

n 


It  was  mentioned  in  Section  4  that  it  is  difficult  if  not  impos-nib 
to  calculate  in  general  the  exact  mean  squared  error  of  or 


for  small  n  due  to  the  right -censorship.  Also, 


the  mean  square  convergence  (with  a  rate) 


has  not  yet  been  obtained.  Hence,  to  find  an  optimal  in  the  sei;  of 

A 

minimum  mean  squared  error  of  or  seems  to  be  quite  difficult.  These 

simulations,  however,  indicate  reasonable  ranges  of  h^  values  which  give 

small  mean  squared  errors  under  the  assumed  models  and  censoring  percentages. 

As  an  example  of  the  quantile  estimators,  the  life  test  data  for  a  -  40 

mechanical  switches  reported  by  Nair  (1984)  are  used.  Two  failure  modes,  A  and  B 

were  recorded  and  Nair  (1984)  estimated  the  survival  function  of  mode  A, 

assuming  the  random  right-censorship  model.  Table  3  shows  the  40  observations 

with  the  corresponding  6^  values  (6^ *  1  indicates  failure  mode  A  and  6^ « 0 

denotes  a  censored  value).  There  are  seventeen  uncensored  observations,  slightly 

more  than  50%  censoring.  From  Table  1(a)  with  n  = 50,  the  values  of  h  chosen 

”  n 

for  this  example  were  0.03  for  Q  (p)  and  0.15  for  Q*(p).  (Also,  respective 

n  n 

values  of  0.05  and  0.19  were  tried  yielding  similar  estimates.)  Figure  .1 

ft 

shows  the  estimates  Qn(p)  an^  Q^P) ,  calculated  using  the  triangular  kernel, 

A 

along  with  the  PL  quantile  function  Q  (p) .  Due  to  the  large  number  of  censored 

n 

observations,  the  estimates  for  large  p  reflect  the  small  estimated  mean 


squared  error  ratios  in  Tables  1  and  2  for  p  2  0.90.  In  particular,  the 

* 

estimate  is  not  very  smooth  for  moderate  to  large  p  and  could  be  smoothed 

more  by  taking  larger  hR,  say  hR  -  0.35.  However,  as  indicated  by  Table  1, 

the  performance  deteriorates  for  larger  p  with  such  h  ,  and  the  estimate  Q 

n*  n 

A 

falls  much  below  and  Qn  for  this  data.  The  estimates  of  median  lifetime 
are  Q_(0.50)  -  2.5478,  Q*(0.5)  =  2.4354,  and  Q_(0.5)  =  2.5480. 


§ 
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TABLE  1.  Ratios  of  MSEs  with  Triangular  Kernel 

(a)  50%  Censoring 
n  *  100 

.03  .05  .07  .09  .11  .13  .15  .19  .21  .25  .31 


a. 

1.16 

1.18 

1.31 

1.34 

1.43 

1.40  1.51 

1.40 

1.27 

1.17 

0.84 

0.65 

0.38 

b. 

1.29 

1.37 

1.52 

1.55 

1.66 

1.68  1.85 

1.75 

1.66 

1.58 

1.15 

0.88 

0.50 

a. 

1.04 

1.07 

1.09 

1.12 

1.16 

1.21  1.16 

1.23 

1.22 

1.17 

1.15 

1.08 

0.93 

b. 

1.17 

1.20 

1.22 

1.27 

1.31 

1.35  1.35 

1.40 

1.44 

1.43 

1.50 

1.47 

1.40 

a. 

1.04 

1.07 

1.10 

1.12 

1.14 

1.17  1.14 

1.16 

1.15 

1.10 

1.03 

0.89 

0.73 

b. 

1.14 

1.24 

1.34 

1.34 

1.39 

1.47  1.46 

1.50 

1.61 

1.72 

1.74 

1.85 

1.83 

a. 

1.07 

1.19 

1.23 

1.41 

1.41 

1.31  1.20 

1.43 

1.34 

1.31 

1.60 

1.85 

2.64 

b. 

0.33 

0.63 

1.16 

1.53 

1.51 

1.69  1.27 

1.47 

1.31 

1.30 

2.40 

2.66 

2.65 

a. 

1.11 

1.13 

1.23 

1.30 

1.40 

1.51  1.45 

1.14 

0.94 

0.92 

0.50 

0.51 

0.37 

b. 

0.10 

0.16 

0.21 

0.22 

0.37 

0.64  0.79 

0.76 

0.70 

0.76 

0.45 

0.48 

0.36 

a. 

1.02 

1.02 

0.88 

0.69 

0.57 

0.49  0.41 

0.34 

0.29 

0.28 

0.22 

0.22 

0.19 

b. 

0.09 

0.10 

0.37 

0.55 

0.57 

0.54  0.49 

0.40 

0.33 

0.31 

0.24 

0.24 

0.20 

n  *  50 

a. 

1.08 

1.12 

1.22 

1.26 

1.29 

1.38  1.41 

1.47 

1.34 

1.24 

0.91 

0.87 

0.59 

b. 

1.48 

1.52 

1.58 

1.75 

1.79 

1.92  2.06 

2.17 

2.04 

1.97 

1.54 

1.43 

0.98 

a. 

1.04 

1.09 

1.11 

1.13 

1.16 

1.16  1.19 

1.20 

1.22 

1.21 

1.21 

1.17 

0.97 

b. 

1.27 

1.32 

1.32 

1.48 

1.45 

1.53  1.57 

1.57 

1.71 

1.71 

1.90 

1.88 

1.80 

a. 

1.09 

1.07 

1.07 

1.10 

1.15 

1.17  1.26 

1.21 

1.10 

1.21 

1.04 

0.96 

0.97 

b. 

0.34 

1.08 

1.34 

1.57 

1.73 

1.92  2.09 

1.89 

2.00 

2.40 

2.14 

2.31 

2.25 

a. 

1.13 

1.10 

1.19 

1.19 

1.33 

1.29  1.41 

1.56 

1.42 

1.62 

1.92 

2.35 

2.65 

b. 

0.17 

0.36 

0.61 

0.76 

1.07 

0.93  0.95 

1.29 

0.98 

1.22 

2.16 

2.74 

2.20 

a. 

1.06 

1.11 

1.13 

1.15 

1.16 

1.16  1.08 

0.90 

0.77 

0.69 

0.51 

0.48 

0.34 

b. 

0.11 

0.13 

0.17 

0.16 

0.25 

0.45  0.60 

0.70 

0.63 

0.63 

0.50 

0.48 

0.34 

a. 

1.00 

0.99 

0.88 

0.75 

0.63 

0.55  0.51 

0.41 

0.39 

0.36 

0.33 

0.33 

0.29 

b. 

0.12 

0.13 

0.47 

0.66 

0.67 

0.68  0.68 

0.55 

0.53 

0.46 

0.38 

0.36 

0.32 

(MSE  Fn1 )/(MSE  Qn), 

b  * 

(MSE  F;1)/(MSE  Q*) 
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|  TABLE  1.  Ratios  of  MSEs  with  Triangular  Kernel 

(b)  30%  Censoring 

•;  n  ■  100 


■ 

9 

.03 

in 

o 

• 

.07 

.09 

.11 

.13 

in 

H 

• 

.19 

.21 

.25 

.31 

.35 

.41 

.10 

a. 

1.10 

1.15 

1.24 

1.26 

1.32 

1.34 

1.41 

1.37 

1.19 

1.11 

0.80 

0.62 

0.35 

b. 

1.22 

1.30 

1.39 

1.44 

1.50 

1.55 

1.66 

1.64 

1.49 

1.42 

1.02 

0.78 

0.43 

■ 

is 

.25 

a. 

1.04 

1.08 

1.09 

1.13 

1.14 

1.17 

1.17 

1.20 

1.21 

1.22 

1.22 

1.09 

0.94 

1 

b. 

1.09 

1.16 

1.17 

1.22 

1.22 

1.25 

1.28 

1.29 

1.35 

1.38 

1.43 

1.31 

1.19 

.50 

a. 

1.04 

1.05 

1.07 

1.09 

1.10 

1.11 

1.12 

1.16 

1.09 

1.11 

1.06 

0.94 

0.74 

b. 

1.10 

1.11 

1.15 

1.19 

1.20 

1.21 

1.22 

1.26 

1.24 

1.29 

1.29 

1.22 

1.05 

s 

.75 

a. 

1.02 

1.15 

1.10 

1.10 

1.08 

1.11 

1.12 

1.05 

1.00 

0.79 

0.82 

0.98 

1.41 

i 

b. 

1.02 

1.38 

1.27 

1.33 

1.35 

1.40 

1.43 

1.53 

1.57 

1.50 

1.39 

1.46 

1.68 

.90 

a. 

1.11 

1.23 

1.34 

1.21 

1.23 

1.67 

1.71 

1.73 

1.21 

1.05 

0.55 

0.52 

0.36 

b. 

0.69 

0.97 

1.25 

1.15 

1.34 

2.07 

1.72 

1.61 

1.10 

0.98 

0.53 

0.50 

0.35 

.95 

a. 

1.23 

1.27 

1.61 

1.66 

1.18 

0.94 

0.69 

0.51 

0.40 

0.34 

0.27 

0.  26 

0.21 

1 

b. 

0.32 

0.38 

1.18 

1.50 

1.08 

0.94 

0.70 

0.53 

0.42 

0.36 

0.28 

0.27 

0.22 

n  *  50 

1 

rt 

.10 

a. 

0.95 

1.00 

1.06 

1.12 

1.15 

1.21 

1.20 

1.34 

1.17 

1.17 

0.83 

0.77 

0.55 

b. 

1.17 

1.26 

1.29 

1.44 

1.46 

1.54 

1.59 

1.80 

1.61 

1.66 

1.23 

1.10 

0.77 

>•- 

.25 

a. 

1.07 

1.07 

1.08 

1.12 

1.14 

1.10 

1.17 

1.17 

1.21 

1.18 

1.23 

1.23 

1.12 

b. 

1.21 

1.21 

1.18 

1.28 

1.29 

1.27 

1.35 

1.37 

1.45 

1.42 

1.59 

1.60 

1.55 

.50 

a. 

1.05 

1.04 

1.08 

1.09 

1.11 

1.08 

1.12 

1.18 

1.08 

1.13 

1.10 

1.06 

0.93 

***• 

b. 

1.13 

1.12 

1.21 

1.24 

1.28 

1.25 

1.33 

1.40 

1.42 

1.42 

1.48 

1.56 

1.46 

"1" 

.75 

a. 

1.08 

1.07 

1.12 

1.14 

1.13 

1.15 

1.12 

1.04 

1.25 

0.92 

1.11 

1.34 

3  .79 

ji*  ■ 

b. 

0.62 

1.16 

1.24 

1.42 

1.50 

1.54 

1.74 

1.65 

2.11 

1.64 

2.01 

3 .83 

2.15 

■.  «✓ 

.90 

a. 

1.10 

1.24 

1.39 

1.23 

1.42 

1.69 

1.83 

1.84 

1.59 

1.22 

0.94 

0.67 

0.51 

V 

b. 

0.22 

0.53 

0.76 

0.65 

1.08 

1.53 

1.78 

1.68 

1.44 

1.17 

0.92 

0.66 

0.50 

¥ 

.95 

a. 

1.13 

1.24 

1.21 

1.14 

0.96 

0.83 

0.65 

0.49 

0.42 

0.39 

0.36 

0.28 

0.24 

ft* 

b. 

0.19 

0.21 

0.71 

1.04 

0.98 

0.91 

0.72 

0.56 

0.47 

0.42 

0.39 

0.30 

0.25 

(USE  Fn_1)/(MSE  Qn),  b  ■  (MSE  F_1)/(MSE  Q*) 


\w*. 
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• 

: 

; 

TABLE  2.  Ratios  of  MSEs  with  Uniform 

Kernel 

• 

(a)  50%  Censoring  (n»100) 

< 

.03 

.05 

.07 

.09 

.10 

.13 

.15  .20 

.25 

.30 

.35 

.40 

• 

.10 

A. 

1.12 

1.13 

1.25 

1.26 

1.29 

1.30 

1.40  1.52 

1.53 

1.50 

1.19 

0.99 

b. 

0.83 

1.10 

1.35 

1.41 

1.42 

1.50 

1.64  1.81 

1.88 

1.90 

1.57 

i .  35 

.25 

a. 

1.02 

1.05 

1.06 

1.08 

1.10 

1.15 

1.13  1.17 

1.24 

1.24 

1.29 

1.15 

b. 

0.66 

0.83 

1.03 

1.08 

1.12 

1.17 

1.23  1.30 

1.43 

1.42 

1.47 

1.40 

• 

* 

.50 

a. 

1.03 

1.06 

1.09 

1.10 

1.06 

1.13 

1.13  1.12 

1.17 

1.16 

1.18 

1.03 

V 

b. 

0.26 

0.54 

0.83 

0.95 

0.96 

1.07 

1.17  1.31 

1.32 

1.40 

1.60 

1.50 

.75 

a. 

1.06 

1.14 

1.19 

1.35 

1.13 

1.25 

1.14  1.32 

1.52 

1.48 

1.31 

1.16 

b. 

0.13 

0.27 

0.54 

0.74 

0.73 

1.02 

0.84  1.15 

1.39 

1.28 

1.21 

0.88 

A 

.90 

a. 

1.08 

1.08 

1.17 

1.21 

1.26 

1.35 

1.47  1.46 

1.22 

0.93 

0.71 

0.50 

b. 

0.06 

0.10 

0.17 

0.20 

0.22 

0.23 

0.23  0.20 

1.32 

1.12 

0.83 

0 ,  56 

.  ‘  * 

.95 

a. 

1.02 

1.01 

1.02 

1.01 

1.01 

0.68 

0.53  0.36 

0.29 

0.26 

0.24 

0.22 

b. 

0.08 

0.10 

0.09 

0.10 

0.10 

0.53 

0.73  0.53 

0.38 

0.31 

0.27 

0.24 

ft 

(b)  30%  Censoring  <n*100) 

V- 

.10 

a. 

1.08 

1.07 

1.18 

1.19 

1.22 

1.24 

1.34  1.46 

1.46 

1.40 

1.14 

0.92 

ft... 

— 

b. 

1.14 

1.28 

1.39 

1.41 

1.17 

1.40 

1.56  1.73 

1.73 

1.72 

1.43 

1.18 

.25 

a. 

1.03 

1.07 

1.07 

1.09 

1.09 

1.12 

1.15  1.17 

1.26 

1.22 

1.26 

1.19 

b. 

0.61 

0.90 

1.03 

1.07 

1.09 

1.16 

1.19  1.26 

1.35 

1.34 

1.41 

1 . 35 

'  'V 

.50 

a. 

1.04 

1.04 

1.05 

1.07 

1.08 

1.08 

1.10  1.13 

1.14 

1.13 

1.13 

1.08 

• 

-  < 

b. 

0.47 

0.74 

0.90 

0.94 

1.03 

1.09 

1.09  1.17 

1.21 

1.26 

1.32 

1.27 

.75 

a. 

1.01 

1.02 

1.08 

1.08 

1.04 

1.11 

1.18  1.10 

1.03 

0.91 

0.80 

0.71 

:;:j 

b. 

0.24 

0.56 

0.76 

0.93 

0.87 

1.11 

1.26  1.20 

1.29 

1.27 

1.21 

1.28 

• 

.90 

a. 

1.07 

1.23 

1.28 

1.19 

1.21 

1.35 

1.24  1.34 

1.77 

1.07 

0.78 

0.43 

'  < 

b. 

0.23 

0.44 

0.69 

0.80 

0.77 

0.99 

0.94  0.75 

1.82 

1.21 

0.85 

0.46 

;  -S- 

.95 

a. 

1.16 

1.27 

1.35 

1.53 

1.51 

1.55 

0.96  0.59 

0.46 

0.32 

0.28 

0.24 

b. 

0.18 

0.31 

0.37 

0.38 

0.29 

1.63 

1.19  0.72 

0.51 

0.35 

0.30 

0.25 

A 

A  » 

(mse  f;1 

)  / (MSE  Qn ) 

b  ■ 

(MSE 

F^mMSE  Q*) 

1.151 

0 

1.667 

1 

.2.119 

0 

2.547 

1 

1.170 

0 

1.695 

1 

2.135 

1 

2.548 

1 

1.248 

0 

1.710 

1 

2.197 

1 

2.738 

0 

1.331 

0 

1.955 

0 

2.199 

0 

2.794 

1 

1.381 

0 

1.965 

1 

2.227 

1 

2.883 

0 

1.499 

1 

2.012 

0 

2.250 

0 

2.883 

0 

1.508 

0 

2.051 

0 

2.254 

1 

2.910 

1 

1.543 

0 

2.076 

0 

2.261 

0 

3.015 

1 

1.577 

0 

2.109 

1 

2.349 

0 

3.017 

1 

1.584 

0 

2.116 

0 

2.369 

1 

3.793 

0 

6. 

CONCLUSION 

The  kernel  type  quantile  estimator  given  in  this  paper  (and  the  approxi¬ 
mate  estimator)  are  smoother  than  the  PL  quantile  function  which  has  been 
used  for  estimation  from  right-censored  data  in  the  past.  Based  on  the  sural 1 
Monte  Carlo  simulation  study,  however,  the  approximate  estimator  Q^(p)  does 
not  seem  to  perform  overall  as  well  as  Q^P)  for  p  >  0.5,  even  though  t  )><■ 
two  estimators  are  asymptotically  equivalent  almost  surely  under  the  state -d 
conditions.  Thus,  Q^(p)  seems  to  be  the  better  small-sample  estimator. 

The  Integrals  Involved  in  computing  Q^P)  *re  easily  calculated  for  the 
simple  kernels  used  in  Section  5. 

Still  under  study  is  the  problem  of  mean  square  convergence  of  the  esi  i 
mators  (^(p)  end  Qn(p)  along  with  possible  rates.  Also,  the  choice  o/  a>> 
optimal  bandwidth  sequence  {h^}  in  the  sense  of  minimum  mean  squared  or.  < 
or  minimum  bias  is  still  being  Investigated.  However,  a  practical  choice  of 
h  can  be  obtained  based  on  results  of  simulations  such  as  those  given  in 

U 

Tables  1  and  2.  For  a  particular  set  of  right-censored  data,  hR  can  eai  il\ 
be  changed  computationally  to  give  as  "smooth"  an  estimate  as  desired. 
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APPENDIX 


The  proofs  of  Theorems  1  and  2  are  presented  here. 


Proof  of  Theorem  1.  First,  write 
h”1  Jj  (^(OKCU-pJ/h^dt  -  Q°(p) 

-  /q  !Qn(t)  -  Q°(t) Jh"1  K((t-p)/hn)dt 

+  th”1  /J  Q°(t)K((t-p)/hn)dt  -  Q°(p) ] 


Now,  following  the  first  part  of  the  proof  of  Yang's  (1984)  Theorem  ]  wif 
the  8 ample  quantile  function  replaced  by  Q under  the  assumptions  givor, » 
expression  I  can  be  integrated  by  parts  to  obtain 
F  (x) 

1  ■  -  Q  </,"(„)  »„  K<(t-p)/hn)dC}to 

±  ro  fFn(x)  -  Fq(x) Ih”1  K((F0(x)  -  p)/hn)dx 

A 

«oo  —1  A 

-  -  \  K((t-P)/hn)dt  -  |Fn(,) 

-  ty*>]h‘J  K((FQ(x)  -  p)/hn)}dx 

-  ^  tFn<*>  -  Fo(x)lhn1  K<<Vx)  -  P)/hn)dx 


l2  * 


Using  condition  (F.4)  and  the  law  of  the  Iterated  logarithm  (LIL)  result  for 


F  given  by  Foldes  and  Rejto  (1981),  the  same  argument  used  to  obtain  Yang' 


(1984)  inequality  (4)  can  be  used  to  show  that 


3/2 

1 1_  |  £  0(max{1°^V'>^n  ,  - })  almost  surely.  (A.l) 

1  nh  si  i.i 

n  n  n 

n 

Also,  with  probability  one, 

|I2I  s  II F,  -  fq||  h;1  K((y-p)/hn)/£0(Q°(y))dy, 

/V  /V 

where  ]]  Fn  -  FQ||  =  sup|Fn(x)  -  FQ(x) | ,  and  thus,  by  conditions  (h.l), 

X 

(K.3),  and  (F.2)  and  again  using  the  LIL  for  F^  (with  condition  (F.4)) 
it  can  be  shown  that  asymptotically 

1 |  *  with  probability  one.  (A. 2) 

Finally,  using  conditions  (h.l),  (K.l)  -  (K.5>,  (F.l),  (F.3),  arid  (F.^) 
ar.d  Theorem  1A  of  Parzen  (1962),  it  fellows  that 

1 13 1  -  1/q  Q°(t)hn"*1  K((t-p)/hn)dt  -  Q°(p)  i  -  o(l).  (A-  3) 

3/4  -1 

Therefore,  combining  (A.l),  (A. 2),  and  (A. 3),  since  (loglogn/n)  hR 

3/2  -1 

as  n  -*■  00  implies  that  (log  log  n)/(n  hQ )  •*  0  and  (loglogn/n)  hn  -*■ 
as  n  -*■  «,  Theorem  1  is  proved.  /// 


Proof  of  Theorem  2.  For  0  £  p  £  1, 

*  1  n  Si 

QJP)  -  (^(P)  -  h~x  l  Z.[siK((Si-p)/hn)  -  Jsx  K((t-p)/hn)dL]. 

i*l  i-1 

* 

When  8^^  >  0,  that  is,  Zi  is  uncensored,  let  be  an  interior  point  ot 

the  interval  (Sj^.S^)  with  probability  one  so  that 


*  S 

K((Si-p)/hn)  -  Jg1  K((t-p)/hn)dt  a.s. 


Then  using  condition  (K.6), 


|Q^(P)  -  <^(P)|  S  h"1  l  ZiSi  |K((S±-p)/hn)  -  K((S*-p)/hn)| 


i-1 


5  r  h^2  l  h  *i'si  -  si*l 

i"l 


r  »;;2  i  h  -i 

i*l 


a.s. 


(A. 4) 


Now,  by  the  continuity  of  FQ  (F.l),  using  the  definition  of  and  Sj, 
(A.4)  can  be  written  as 

IV")  -  V»>l  S  "V  lml  zllVzi>  -  VVK'V 

•  rhn2  ‘  0  ‘  V*'>  :«„<*> 


s  n>'2  r„  »[!?„(*)  -  F„<»)i 


n  J0 


+  I Fq(x”)  -  Fn(x")|]dFn(x) 


S  2Hf2  || Fn  -  Fq|!  xdFn(x)  a.s.,  (A.5) 


where  g(x  )  denotes  the  limit  from  the  left  at  x  of  the  function  g  and 
|!  Fn  -  F0jj  -  sup  |Fn(x)  -  Fq(x)|. 

*  MB  *  * 

Now,  since  P  (x)  *  0  for  x  >  Z  ,  IT  x  d F  (x)  ■  T(Z  ),  and  by  the 
n  n  'O  n  n 

A 

assumptions  of  the  theorem  and  the  LIL  for  F^  of  Foldes  and  Rejto  (1981), 
from  (A.5) 

lQn(p)  ”  *  0((log  logn/n)*  hn2)  a.s.  (A. 6) 

Noting  that  the  right-hand-side  of  (A.5)  does  not  involve  p,  the  conclusion 
of  the  theorem  follows.  Ill 


2 


ACKNOWLEDGMENTS 


The  author  is  grateful  to  Winston  Yang  for  a  preprint  of  his  paper  and 
to  Kai  Yu  for  helpful  discussions  concerning  Theorem  2.  This  research  was 
supported  by  the  United  States  Air  Force  Office  of  Scientific  Research  >nd 
Army  Research  Office  under  grant  number  AFOSR-84-0156. 


REFERENCES 


Breslow,  N. ,  and  Crowley,  J.  (1974),  "A  Large  Sample  Study  of  the  Life  Table 

and  Product  Limit  Estimates  Under  Random  Censorship,"  Annals  of  Statistics 
2,  437-453. 

Cheng,  K.  F.  (1981),  "On  Almost  Sure  Representations  for  Quantiles  of  the 
Product  Limit  Estimator  with  Applications,"  Research  Report  87,  SUNV 
Buffalo,  Dept,  of  Statistics. 

Csorgo^  M.  (1983) >  Quantile  Processes  with  Statistical  Applications,  d'MS  NSF 
Regional  Conference  Series  in  Applied  Mathematics,  SIAM,  Philadelphia,  PA. 

Efron,  B.  (1967),  "The  Two-Sample  Problem  with  Censored  Data,"  Proceeding’  <>) 
the  Fifth  Berkeley  Symposium,  4^,  831-853. 

Falk,  M.  (1984),  "Relative  Deficiency  of  Kernel  Type  Estimators  of  Quantiles," 
Annals  of  Statistics,  12 ,  261-268. 

Foldes,  A.,  and  Rejto,  L.  (1981),  "A  LIL  Type  Result  for  the  Product  Limit 
Estimator,"  Zeitschrift  fur  Wahrscheinlichkeitstheorie  und  Verwande 
Gebiete,  56,  75-86. 

Foldes,  A.,  Rejto,  L. ,  and  Winter,  B.  B.  (1980),  "Strong  Consistency  Prop*  > l ios 
of  Nonparametric  Estimators  for  Randomly  Censored  Data,  I:  The  Prodn-t 
Limit  Estimator,"  Periodica  Mathematica  Hungarica,  11,  233-250. 

Gill,  R.  (1983),  "Large  Sample  Behavior  of  the  Product-Limit  Estimator  on  1  lx 
Whole  Line,"  Annals  of  Statistics,  11 ,  49-58. 

International  Mathematical  and  Statistical  Libraries  (1982),  1MSL,  Inc.,  7500 
Bellaire  Blvd.,  Houston,  TX. 

Kalgh,  W.  D.,  and  Lachenbruch,  P.  A.  (1982),  "A  Generalized  Quantile  Estimator,' 
Communications  in  Statistics-Theory  and  Methods,  11,  2217-2238. 


Kaplan,  E.  L. ,  and  Meier,  P.  (1958),  "Nonparametric  Estimation  from  Incomplete 
Observations,"  Journal  of  the  American  Statistical  Association,  53,  457-  'i81 


22 


McNichols,  D.  T.t  and  Padgett,  W.  J.  (1984),  "A  Modified  Kernel  Dei;  ii y 

Estimator  for  Randomly  Right-Censored  Data,"  South  African  Statist  i ; 
Journal ,  18,  13-27. 

Miller,  R.  G.,  Jr.  (1981),  Survival  Analysis,  John  Wiley  &  Sons,  New  York. 

Nair,  V.  N.  (1984),  "Confidence  Bands  for  Survival  Functions  with  Cense ;r : 

Data:  A  Comparative  Study,"  Technometrics,  26,  265-275. 

Padgett,  W.  J.,  and  McNichols,  D.  T.  (1984),  "Nonparametric  Density  >s  .  don 
from  Censored  Data,"  Communications  in  Statistics-Theory  and  Mei'i-  ,  1J3, 
1581-1611. 

Parzen,  E.  (1962),  "On  Estimation  of  a  Probability  Density  Function  unr  Node," 
Annals  of  Mathematical  Statistics,  33,  1065-1076. 

Parzen,  E.  (1979),  "Nonparametric  Statistical  Data  Modeling,"  Journal  oi  the 
American  Statistical  Association,  74 ,  105-121. 

Reid,  N.  (1981),  "Estimating  the  Median  Survival  Time,"  Biometrika ,  68  Col-608. 

Sander,  J.  (1975),  "The  Weak  Convergence  of  Quantiles  of  the  Product  Lj  ■ 
Estimator,"  Technical  Report  5,  Stanford  University,  Dept,  of  Stall  • 

Susarla,  V.,  and  Van  Ryzin,  J.  (1980),  "Large  Sample  Theory  for  an  Ej.iJi 
of  the  Mean  Survival  Time  from  Censored  Samples,"  Annals  of  Static  ^ 
1002-1016. 

Yang,  S.  S.  (1984),  "A  Smooth  Nonparametric  Estimator  of  a  Quantile  Fir 
Technical  Report,  Kansas  State  University,  Dept,  of  Statistics. 


